NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Decalf: A Directed, Effectful Cost-Aware Logical Framework

https://doi.org/10.1145/3632852

Grodin, Harrison; Niu, Yue; Sterling, Jonathan; Harper, Robert (January 2024, Proceedings of the ACM on Programming Languages)

We presentdecalf, adirected,effectfulcost-awarelogicalframework for studying quantitative aspects of functional programs with effects. Likecalf, the language is based on a formalphase distinctionbetween theextensionand theintensionof a program, its purebehavioras distinct from itscostmeasured by an effectful step-counting primitive. The type theory ensures that the behavior is unaffected by the cost accounting. Unlikecalf, the present language takes account ofeffects, such as probabilistic choice and mutable state. This extension requires a reformulation ofcalf’s approach to cost accounting: rather than rely on a ”separable” notion of cost, herea cost bound is simply another program. To make this formal, we equip every type with an intrinsic preorder, relaxing the precise cost accounting intrinsic to a program to a looser but nevertheless informative estimate. For example, the cost bound of a probabilistic program is itself a probabilistic program that specifies the distribution of costs. This approach serves as a streamlined alternative to the standard method of isolating a cost recurrence and readily extends to higher-order, effectful programs. The development proceeds by first introducing thedecalftype system, which is based on an intrinsic ordering among terms that restricts in the extensional phase to extensional equality, but in the intensional phase reflects an approximation of the cost of a program of interest. This formulation is then applied to a number of illustrative examples, including pure and effectful sorting algorithms, simple probabilistic programs, and higher-order functions. Finally, we justifydecalfvia a model in the topos of augmented simplicial sets.
more » « less
Full Text Available
A Metalanguage for Cost-Aware Denotational Semantics

https://doi.org/10.1109/LICS56636.2023.10175777

Niu, Yue; Harper, Robert (June 2023, 2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS))

We present metalanguages for developing synthetic cost-aware denotational semantics of programming languages. Extending recent advances by Niu et al. in cost and behavioral verification in dependent type theory, we define two successively more expressive metalanguages for studying cost-aware metatheory. We construct synthetic denotational models of the simply-typed lambda calculus and Modernized Algol, a language with first-order store and while loops, and show that they satisfy a cost-aware generalization of the classic Plotkin-type computational adequacy theorem. Moreover, by developing our proofs in a synthetic language of phase-separated constructions of intension and extension, our results easily restrict to the corresponding extensional theorems. Consequently, our work provides a positive answer to the conjecture raised in op. cit. and contributes a framework for cost-aware programming, verification, and metatheory.
more » « less
Full Text Available
mL-BFGS: A Momentum-based L-BFGS for Distributed Large-Scale Neural Network Optimization

Niu, Yue; Fabian, Zalan; Lee, Sunwoo; Soltanolkotabi, Mahdi; Avestimehr, Salman (July 2023, Transactions on Machine Learning Research)

Quasi-Newton methods still face significant challenges in training large-scale neural networks due to additional compute costs in the Hessian related computations and instability issues in stochastic training. A well-known method, L-BFGS that efficiently approximates the Hessian using history parameter and gradient changes, suffers convergence instability in stochastic training. So far, attempts that adapt L-BFGS to large-scale stochastic training incur considerable extra overhead, which offsets its convergence benefits in wall-clock time. In this paper, we propose mL-BFGS, a lightweight momentum-based L-BFGS algorithm that paves the way for quasi-Newton (QN) methods in large-scale distributed deep neural network (DNN) optimization. mL-BFGS introduces a nearly cost-free momentum scheme into L-BFGS update and greatly reduces stochastic noise in the Hessian, therefore stabilizing convergence during stochastic optimization. For model training at a large scale, mL-BFGS approximates a block-wise Hessian, thus enabling distributing compute and memory costs across all computing nodes. We provide a supporting convergence analysis for mL-BFGS in stochastic settings. To investigate mL-BFGS’s potential in large-scale DNN training, we train benchmark neural models using mL-BFGS and compare performance with baselines (SGD, Adam, and other quasi-Newton methods). Results show that mL-BFGS achieves both noticeable iteration-wise and wall-clock speedup.
more » « less
Full Text Available
Equivariant variance estimation for multiple change-point model

https://doi.org/10.1214/23-EJS2190

Hao, Ning; Niu, Yue Selena; Xiao, Han (January 2023, Electronic Journal of Statistics)

Full Text Available
A cost-aware logical framework

https://doi.org/10.1145/3498670

Niu, Yue; Sterling, Jonathan; Grodin, Harrison; Harper, Robert (January 2022, Proceedings of the ACM on Programming Languages)

We presentcalf, acost-awarelogicalframework for studying quantitative aspects of functional programs. Taking inspiration from recent work that reconstructs traditional aspects of programming languages in terms of a modal account ofphase distinctions, we argue that the cost structure of programs motivates a phase distinction betweenintensionandextension. Armed with this technology, we contribute a synthetic account of cost structure as a computational effect in which cost-aware programs enjoy an internal noninterference property: input/output behavior cannot depend on cost. As a full-spectrum dependent type theory,calfpresents a unified language for programming and specification of both cost and behavior that can be integrated smoothly with existing mathematical libraries available in type theoretic proof assistants. We evaluatecalfas a general framework for cost analysis by implementing two fundamental techniques for algorithm analysis: themethod of recurrence relationsandphysicist’s method for amortized analysis. We deploy these techniques on a variety of case studies: we prove a tight, closed bound for Euclid’s algorithm, verify the amortized complexity of batched queues, and derive tight, closed bounds for the sequential andparallelcomplexity of merge sort, all fully mechanized in the Agda proof assistant. Lastly we substantiate the soundness of quantitative reasoning incalfby means of a model construction.
more » « less
A Super Scalable Algorithm for Short Segment Detection

https://doi.org/10.1007/s12561-020-09278-z

Hao, Ning; Niu, Yue Selena; Xiao, Feifei; Zhang, Heping (April 2020, Statistics in Biosciences)
null (Ed.)
Full Text Available
Automatic Space Bound Analysis for Functional Programs with Garbage Collection

https://doi.org/10.29007/xkwx

Niu, Yue; Hoffmann, Jan (October 2018, EPiC Series in Computing)

This article introduces a novel system for deriving upper bounds on the heap-space requirements of functional programs with garbage collection. The space cost model is based on a perfect garbage collector that immediately deallocates memory cells when they become unreachable. Heap-space bounds are derived using type-based automatic amortized resource analysis (AARA), a template-based technique that efficiently reduces bound inference to linear programming. The first technical contribution of the work is a new operational cost semantics that models a perfect garbage collector. The second technical contribution is an extension of AARA to take into account automatic deallocation. A key observation is that deallocation of a perfect collector can be modeled with destructive pattern matching if data structures are used in a linear way. However, the analysis uses destructive pattern matching to accurately model deallocation even if data is shared. The soundness of the extended AARA with respect to the new cost semantics is proven in two parts via an intermediate linear cost semantics. The analysis and the cost semantics have been implemented as an extension to Resource Aware ML (RaML). An experimental evaluation shows that the system is able to derive tight symbolic heap-space bounds for common algorithms. Often the bounds are asymptotic improvements over bounds that RaML derives without taking into account garbage collection.
more » « less
Full Text Available
A New Reduced-Rank Linear Discriminant Analysis Method and Its Applications

https://doi.org/10.5705/ss.202015.0387

Niu, Yue Selena; Hao, Ning; Dong, Bin (January 2018, Statistica Sinica)

Full Text Available
Interaction screening by partial correlation

https://doi.org/10.4310/SII.2018.v11.n2.a9

Niu, Yue Selena; Hao, Ning; Zhang, Hao Helen (January 2018, Statistics and Its Interface)

Full Text Available
An accurate and powerful method for copy number variation detection

https://doi.org/10.1093/bioinformatics/bty1041

Xiao, Feifei; Luo, Xizhi; Hao, Ning; Niu, Yue S; Xiao, Xiangjun; Cai, Guoshuai; Amos, Christopher I; Zhang, Heping; Hancock, John (January 2019, Bioinformatics)

Full Text Available

« Prev Next »

Search for: All records